CDS

Accession Number TCMCG061C05134
gbkey CDS
Protein Id XP_042037085.1
Location complement(join(2474546..2474710,2474797..2474892,2474975..2475076,2475306..2475524,2475612..2475755,2476383..2476535,2476906..2477127,2477424..2477706,2477810..2477985,2478065..2478370,2478516..2478572,2478662..2478933,2479297..2479371,2479628..2479985,2480057..2480322,2480891..2481074,2481466..2481543,2482257..2482319,2482567..2482665,2483030..2483134))
Gene LOC121783120
GeneID 121783120
Organism Salvia splendens

Protein

Length 1140aa
Molecule type protein
Topology linear
Data_file_division PLN
dblink BioProject:PRJNA737421
db_source XM_042181151.1
Definition protein ALWAYS EARLY 3-like isoform X2 [Salvia splendens]

EGGNOG-MAPPER Annotation

COG_category BDT
Description SANT SWI3, ADA2, N-CoR and TFIIIB'' DNA-binding domains
KEGG_TC -
KEGG_Module -
KEGG_Reaction -
KEGG_rclass -
BRITE ko00000        [VIEW IN KEGG]
ko00001        [VIEW IN KEGG]
KEGG_ko ko:K21773        [VIEW IN KEGG]
EC -
KEGG_Pathway ko04218        [VIEW IN KEGG]
map04218        [VIEW IN KEGG]
GOs -

Sequence

CDS:  
ATGGGACCCACAAGAAAGTCGAGAAGTGTGAATAACCGCTATTCTTATATCGACGATGTATCTCCCAGCAAAGATGGATATAATGCTAAAAAAAGTAACAGTCGGAAAAGAAAGTTGTCTGACATGCTTGGGCCTCGGTGGACCATGGAAGAGCTGACTCGTTTTTATGACTCTTACCGCAAGTACAGTAAAGAGTGGAAGAAGGTAGCTGGTGCTGTCAGAAATCGTTCAGCAGAAATGGTGGAGGCCCTTTACACTATGAATCGGGCATACTTATCTCTTCCACATGGAACTGCTTCTGCAGCTGGGTTAATCGCCATGATGACTGATCACTACAGCAATCTGGCAGGCAGTGATAGTGACCAAGAGAAGAATGGTGGAGCAGGATCATCCCGGAAGACACAGAAGCGTGCTCGTGGGAAAGTGGAGCCCCCCACCCCTAAAGCATTAGATGACATTGCTTCACAATCTCTAATCACTGCATCAAATTATGGCTGCCTTTCATTATTGAAAAAGAATCGCTCTAGTGTTAGCAGAGCTCGTCCTGTCGGAAAAAGAACTCCAAGGTTCCCGGTTTCATTTTCACGTGAAAATATCAACGGGGAGAAATTTTTTTCTCCAACAAGGCAGGGCCTGAAATTAAAGGCCAATGCTGATGACAATGAAGTAGCTCATGAGATAGCTATAGCTTTGGCAGAGGCCTCCCAGAGAGGTGGTTCCCCCAAGGTTTCTGGTACACTGAATAAAAGAGCTGAAAGTGTAATGTCCTCACCTTTTAGGCATGCTCAAAGAAAGCATAATATAGCAGAGATATCAAATGGCAAGCTCTTGGCAACAGACACAGATGAAGAAGATTTAGAAGGAAGCACAGAAGCCGACACTGGTGAATTATCAAGGTGTAAACCCCATTTGATTGAATCTGTAAGTATTGGTACAACGAGACAGAAGGGAAACAAATTTGAAGGGAGAAAGCATGAAGTTGATAACAATATTGAGAATCATTTGGATGACATCAAGGAAGAATGCAGTGGAACAGAGGAAGGTCAGCAACTGCTGAGTGCAATGAGAAAAAAGTTTGATGTGGAGGTCAATGACACCAAGGTGTCCAGATCTCCGATGCTGAAGAAGAGCAAACAGGTTCTCTTTGGCAGAGATGAAGAACCTGCTTTTGATGCCCTGCAAACACTGGCGGATTTGTCACTGATGATGCCAACAGATCATGAATATGATTCCAGGATGCTGTTCAAAGATGATCATGATTATGTTGACGAGACTGTGCCATTAGAATCTCCACCGGCAAACCAGCCAAGAGAAAAACGTAGATCCTCAGGGGTAAGAATGAAAGGACACCTAGCATCAAGTGTTGAAGTTGCTTGTAGCAAAACATCAAAACCTGGAAATAGTTCAGTTTTTCATGGTAGTTCTGTTCCTGAAGAAAACCAGGATTCCCATCAGTCTACCACCAAATCAAGAAAAAAACAAAAGATGCCAGTGCATAAGAATAAAAAAACTGAAGTTCATGCTGAAATTCATCTTAGTGGATCTCCAGGGGTTGAGGCTGGAGATGCAAGCAAGAAATCAATACGGAGTAAAAAGTATTCTCAAAGTAGTTCTCCGAAAGTGATGAAAGTTTCAGAAAACTCTTCTGGTGCTGATTTACAGAAAGAAGGGTCTGACTCAGCCCACTCAGCTACACAGGTTCTGGTCAATCAGGTTAATTTACCTACTAAAATGAGGAGCCGGCGTAAGCTGAGTCTTAAAAAACCATTGGTCAAGAAGGACCTGAAATTTTCTGAAAAGATCTCAAACAATGAGAGTAATCTTTCTCTTGGTTCACTCCATGATACAGTACCTAGATTTAAGAACCTGTCTAATTGTTTAGCGAACCAGCGCCTGAGGAGATGGTGTACTTATGAATGGTTCTACAGTGGCATTGATTATCCATGGTTTGCAAAGAGGGAGTTTGTTGAATATCTGTATCATGTTGGATTGGGTCATGTACCGAGATTAACTCGTGTAGAGTGGGGTGTCATAAGAAGTTCACTTGGTAAACCACGGCGATTTTCAGAGCAGTTCCTGAAGGAAGAAAAAGAGAAGCTTAATCAGTATCGAGATACTGTTAGAAAACATTACTCTGAGCTGCGCGAAGGTCTCAGGGATGGACTACCGAATGATCTTGCAAGGCCTTTATCAGTTGGTCAGCGTGTCGTTGCTATTCATCCAAAAATGAGGGAGATTCATGATGGAAGTGTGCTAACTGTTGATCACTCGAAGTGTCGGGTTCAATTTGACCGTCAAGAGCTAGGGGTTGAATTTGCCATGGATATTGACTGCATGCCTGTAAATCCATTTGAGAACATGCCTGCAATGCTTGGCAGACACACAATCACTGCTGAAATGTTTGAGAGCTTCAATCAACTGAAAGAGAATGGACGAGCAAAGGAGTACATTAAGCTTTCTCCTAGTGACAACCTGGATAATATGGATGATATTTCTGCTCTATCTTTATTAGCTAATCCTGGCAGTTTGCTGAAGCAATCAAAGGTTGCTTCAGATGTTCCAACAAGGCCGGGAACTGGTGATAACGCAAGGTACCAGCAGACATATTCTCAGTCCAGTACACCACTTGCTCAAATCCAGGCAAAGGAAGCTGACATTCAAGCTCTTGCACAACTAACTCGTGCCCTCGATAAAAAGGAAGCTATTGTTTGTGAATTAAGGCTTATGAATGATGATGTGTTGGAAAGTCAGAAGGATGGTGATAGCCCTTTAAAAGAATCGGAACCCTTTAAAAAGCAATATGCTGCGGTGCTTATACAATTGAATGAAGCCAATGAACAGGTCTCTTCAGCTTTACACTGCTTGAGAGAACGAAACACGTATCAAGGAAAATTTCCACCTGCATGGCCGAGGCCACTGCTTGGTCTTGCTGATCCTGGTAACACATTCAATTCTTTTGACCGTTCTGCATGTCAAACTCAAGAAGTGGGATCACATGTGAATGAAATCATGGATAGCTCCAAAACAAAAGCTCGGACTATGGTCGATGCTGCATTGCAGGCGATATCATCACTGACGAATAGGGATGGTACCATGGAGAAGATTGAGGAAGCCATAGACTACGTAAATGACCAGCTTTCCTCGGATGATTCCTGCATGCCAAGAACTGATCCTAAATCCTCAAATGCATCTGACATTGAGGCCCAAATTCCCTCCGAGCTGATCACAAAATGTGTAGCAACTTTGCTAATGATTCAGAAGTGTACCGAACGACAGTTTCCTCCATCCGATGTGGCAAAAATACTAGATTCTGCTGTGACAAGCCTACAGCCACGAAGTTCGCAAAACCTTCCTGTTTATACCGAGATACAGAAGTGTGTGGGCATCATCAAGAACCAGATATTGGCGCTAATACCTACTTAG
Protein:  
MGPTRKSRSVNNRYSYIDDVSPSKDGYNAKKSNSRKRKLSDMLGPRWTMEELTRFYDSYRKYSKEWKKVAGAVRNRSAEMVEALYTMNRAYLSLPHGTASAAGLIAMMTDHYSNLAGSDSDQEKNGGAGSSRKTQKRARGKVEPPTPKALDDIASQSLITASNYGCLSLLKKNRSSVSRARPVGKRTPRFPVSFSRENINGEKFFSPTRQGLKLKANADDNEVAHEIAIALAEASQRGGSPKVSGTLNKRAESVMSSPFRHAQRKHNIAEISNGKLLATDTDEEDLEGSTEADTGELSRCKPHLIESVSIGTTRQKGNKFEGRKHEVDNNIENHLDDIKEECSGTEEGQQLLSAMRKKFDVEVNDTKVSRSPMLKKSKQVLFGRDEEPAFDALQTLADLSLMMPTDHEYDSRMLFKDDHDYVDETVPLESPPANQPREKRRSSGVRMKGHLASSVEVACSKTSKPGNSSVFHGSSVPEENQDSHQSTTKSRKKQKMPVHKNKKTEVHAEIHLSGSPGVEAGDASKKSIRSKKYSQSSSPKVMKVSENSSGADLQKEGSDSAHSATQVLVNQVNLPTKMRSRRKLSLKKPLVKKDLKFSEKISNNESNLSLGSLHDTVPRFKNLSNCLANQRLRRWCTYEWFYSGIDYPWFAKREFVEYLYHVGLGHVPRLTRVEWGVIRSSLGKPRRFSEQFLKEEKEKLNQYRDTVRKHYSELREGLRDGLPNDLARPLSVGQRVVAIHPKMREIHDGSVLTVDHSKCRVQFDRQELGVEFAMDIDCMPVNPFENMPAMLGRHTITAEMFESFNQLKENGRAKEYIKLSPSDNLDNMDDISALSLLANPGSLLKQSKVASDVPTRPGTGDNARYQQTYSQSSTPLAQIQAKEADIQALAQLTRALDKKEAIVCELRLMNDDVLESQKDGDSPLKESEPFKKQYAAVLIQLNEANEQVSSALHCLRERNTYQGKFPPAWPRPLLGLADPGNTFNSFDRSACQTQEVGSHVNEIMDSSKTKARTMVDAALQAISSLTNRDGTMEKIEEAIDYVNDQLSSDDSCMPRTDPKSSNASDIEAQIPSELITKCVATLLMIQKCTERQFPPSDVAKILDSAVTSLQPRSSQNLPVYTEIQKCVGIIKNQILALIPT